MKQA: A Linguistically Diverse Benchmark for Multilingual Open Domain Question Answering

نویسندگان

چکیده

Abstract Progress in cross-lingual modeling depends on challenging, realistic, and diverse evaluation sets. We introduce Multilingual Knowledge Questions Answers (MKQA), an open- domain question answering set comprising 10k question-answer pairs aligned across 26 typologically languages (260k total). are based heavily curated, language- independent data representation, making results comparable of language-specific passages. With languages, this dataset supplies the widest range to-date for evaluating answering. benchmark a variety state- of-the-art methods baselines generative extractive answering, trained Natural Questions, zero shot translation settings. Results indicate is challenging even English, but especially low-resource languages.1

برای دانلود باید عضویت طلایی داشته باشید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Open-domain question answering

Question answering aims to develop techniques that can go beyond the retrieval of relevant documents in order to return exact answers to natural language questions, such as “How tall is the Eiffel Tower?”, “Which cities have a subway system?”, and “Who is Alberto Tomba?”. Answering natural language questions requires more complex processing of text than employed by current information retrieval...

متن کامل

Linguistically Informed Question Answering

ion over Parts of Speech. Lexical units are grouped into frames irrespective of their parts of speech. This allows to easily map, e. g., two text fragments onto each other that carry essentially the same meaning, but where one is headed by a verb and the other by a noun, such as ‘A bought B’ vs. ‘(the) acquisition of B by A’. In GermaNet, this mapping requires additional knowledge in the form o...

متن کامل

Open-domain textual question answering techniques

Textual question answering is a technique of extracting a sentence or text snippet from a document or document collection that responds directly to a query. Open-domain textual question answering presupposes that questions are natural and unrestricted with respect to topic. The question answering (Q/A) techniques, as embodied in today’s systems, can be roughly divided into two types: (1) techni...

متن کامل

Experiments with Open-Domain Textual Question Answering

This paper describes the integration of several knowledge-based natural language processing techniques into a Question Answering system, capable of mining textual answers from large collections of texts. Surprizing quality is achieved when several lightweight knowledge-based NLP techniques comI)lement mostly shallow, surface-based approaclms.

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Transactions of the Association for Computational Linguistics

سال: 2021

ISSN: ['2307-387X']

DOI: https://doi.org/10.1162/tacl_a_00433